Visualization

Data Visualization is the second step in a Rapid Process Troubleshooting project. During this step, data is visualized and prepared for modelling. Visual and statistical techniques are used to prepare the data that will be used in the following steps to build models. Thus it is important to use good quality data as well as data that is an accurate representation of the process.

In this step, you can:

Set limits for process variables.
Decorrelate the dataset using a correlation matrix.
Visualize data using:
Identify trends and clusters in the dataset with the regional brushing capabilities.

Visualization view

Visualization panel

Accessing the Visualization view

You can access the Visualization view from the troubleshooting context bar located to the left of the Discrete & Batch Troubleshooter project view. The Visualization view becomes active when the minimum requirements for the Data Preparation step are met. These include importing a dataset, and loading the data into the Troubleshooter project.

The Visualization view

The Visualization view provides the following information and options:

Field

This is the field name as configured during the data import stage.
Category

Fields are classified into different categories, such as a target field, a process state, a disturbance, or an adjustable field type.

Histogram icon

Click the histogram icon provided on the list view to view a histogram showing the distribution of the selected field.
Trend icon

Click the trend icon provided on the list view to view a trend against the specified index across the entire dataset for the selected field.
Limits

Select Enable or Disable to enable or disable the limits set for each field. Setting this option to Enabled will set the qualities of all tag values that exceed either the High high or the Low low limits to bad. All bad quality data points will not be used during model construction and only good quality data (data that falls within the High high and Low low limits) will be used for modelling purposes.
Low Low

This is the minimum value contained in the loaded data set for the given field.
Low

The Low limit is a secondary limit that indicates the "operational" lower limit for a given field.
High

The High limit is a secondary limit that indicates the "operational" higher limit for a given field.
High High

This is the maximum value contained in the loaded data set for the given field
Unique

This is the number of unique values contained in the loaded data set for the given field.
For any field containing more than 1000 unique values across the entire data set, the view simply displays ">1000" for the given field.

Visualization Panel

The following options are available as buttons below the field list on the Preparation View:

Multiple Histogram

Allows for opening multiple histograms from a field selection dialog.
Multiple Trends

Allows for opening multiple trends from a field selection dialog.
Field Correlation

Opens the Correlation matrix for decorrelation of the data set.
Remove Models

Quickly remove all models from your project.

NOTE: Limits cannot be set if a model has been constructed. Click the Remove Models button to remove all models from the project.